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Abstract 

On' 

g_) ' The methods of effective field theory are used to study generic theories of 

j-^. inflation with a single inflaton field. For scalar modes, the leading corrections 

to the IZ correlation function are found to be purely of the /j-inflation type. 

CN ' For tensor modes the leading corrections to the correlation function arise 

from terms in the action that are quadratic in the curvature, including a 

Q> , parity- violating term that makes the propagation of these modes depend on 

^1^ I their helicity. These methods are also briefly applied to non-generic theories 

• ' of inflation with an extra shift symmetry, as in so-called ghost inflation. 
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I. Generic Theories of Inflation 

Observations of the cosmic microwave background and large scale struc- 
ture are consistent with a simple theory of inflation [1] with a single canoni- 
cally normalized inflaton field ipdx), described by a Lagrangian 



^0 = y/g 






:i) 



where g = —Detg^,y, Mp = l/\/87rG is the reduced Planck mass, and V{ipc) 
is a potential down which the scalar field rolls more-or-less slowly. With this 
theory, the strength of observed fluctuations in the microwave background 
matter density indicates that the cosmic expansion rate H = a/a and the 
physical wave number k/a at horizon exit, when these are equal, have the 
value[2] H = k/a k, ^x2x lO^"^ GeV, where e is the value of -H/H'^ 
at this time, and a is the Robertson- Walker scale factor. Hence H and 
k/a at horizon exit are likely to be much less than Mp ~ 2.4 x 10^^ GeV, 
and even considerably less than a plausible grand unification scale ~ 10^^ 
GeV. This provides a justification after the fact for using a Lagrangian (1) 
with a minimum number of spacetime derivatives. (As is well known, (1) 
is the most general Lagrangian density for gravitation and a single scalar 
field with no more than two spacetime derivatives. An arbitrary function 
of if multiplying the first term could be eliminated by a redefinition of the 
metric, and an arbitrary function of if multiplying the second term could be 
eliminated by a redefinition of (/?.) 

But H and k/a at horizon exit are not entirely negligible compared with 
whatever fundamental scale characterizes the theory underlying inflation, 
and at earlier times k/a is exponentially larger than at horizon exit, so it 
is worth considering the next corrections to (1). We assume that (1) is 
just the first term in a generic effective field theory, in which terms with 
higher derivatives are suppressed by negative powers of some large mass 
M, characterizing whatever fundamental theory underlies this effective field 
theory. Rather than committing ourselves to any particular underlying the- 
ory, we will simply assume that all constants in the higher derivative terms 
of the effective Lagrangian take values that are powers of M indicated by 
dimensional analysis, with coefficients roughly of order unity. Because H 
and k/a are so large during infiation, observations of fluctuations produced 
during infiation provide a unique opportunity for detecting effects of higher 
derivative terms in the gravitational action. 



To get some idea of the value of M, we note that the unperturbed canon- 
ically normahzed scalar field (pc described by the Lagrangian (1) has a time 
derivative (p^ = \/2eMpH, so the change in (pc during a Hubble time 1/ H at 
around the time of horizon exit is of order (p^/H = y2eMp. If we are to use 
effective field theory to study fluctuations at about the time of horizon exit 
in generic theories in which the dependence of the action on ipc is uncon- 
strained by symmetry principles or by other consequences of an underlying 
theory, and if (1) is at least a fair first approximation to the full theory, then 
the mass M that is characteristic of the effective field theory of inflation can 
not be much smaller than \pTeMp^ for if it were then there would be no limit 
on the size of higher-derivative terms containing many powers of (pc/M. It 
follows that the expansion parameter H/M in this class of theories is no 
greater than H/\/2eMp ~ 6 x 10~^, whatever the value of e. 

We will tentatively assume here that M is of order \/2eMp, in which case 
the coefficients of the higher-derivative terms in the effective Lagrangian 
have to be taken as arbitrary functions of ipc/M. This is likely to be the 
case if e is not too small, say of order 0.02, since then there is not much 
difference between \/2eMp and Mp, and M is unlikely to be much larger 
than Mp. (The considerations presented below would still be valid if M 
were instead much larger than \/2€Mp, as for instance if M ss Mp and e 
is very small, but then we would have to count powers of ^d^ as well as 
numbers of derivatives in judging how much the various higher-derivative 
terms are suppressed, and some of the coefficient functions in the effective 
Lagrangian derived below would be negligible, and all others very simple.) 
From now on we will work with a dimensionless scalar field i-p = ipc/M, and 
write (1) as 



^0 = Vo 



-fR - —g^-d^^d,^ - MlU{ip) 



where U{'p) = V{Mip)/Mp. Note that the unperturbed value of C/ is (3 — 
e)i?^, so we can think of U as well as d^ipd^ip as both being of order H^ at 
horizon exit. 

The leading correction to (2) will consist of a sum of all generally covari- 
ant terms with four spacetime derivatives and coefficients of order unity [3]. 
By a judicious weeding out of of total derivatives, the most general such 
correction term can be put in the form [4] 



A£ = ^ 



/i(v9)(5^>,^(^,,)' + /2(v?)/V,pV?,.n^ 



+ho{vV''C^,^''Cp,^x , (3) 

where as usual commas denote ordinary derivatives and semicolons denote 
covariant derivatives; Qip = g^'^ip^f^-u is the invariant d'Alembertian of ip; 
^jiupa jg ^]^g totally antisymmetric tensor density with e^^so = -)-ij and the 
fnif) are dimensionless functions, treated here as of order unity. In the last 
two terms, instead of the Riemann-Christoffel tensor R^upai we have used 
the Weyl tensor 

^pupa ^= ^pupa '^\9pp^V(T 9pcr^vp 9i'p^pa~T~guiT^pp j I'TT \9pp9uiT 9vp9pcT 

(4) 
Writing the last two terms in Eq. (3) as bilinears in C^^pa rather than Rfj,upa 
has no effect in the last term, and in the penultimate term of course just 
amounts to a different definition of /y and /g. (Similarly, instead of writing 
the penultimate term as a bilinear in Cp^jypa or R^upa-, we could have written 
it as the linear combination of curvature bilinears that appears in the Gauss- 
Bonnet identity; even though this linear combination is a total derivative, it 
would affect the field equations because its coefficient fg{f) is not constant.) 
Our reason for choosing to use the Weyl tensor in the last two terms will 
become apparent soon. 

The correction term (3) involves second time derivatives, as well as fields 
and their first time derivatives. If we took such a theory literally, we would 
find more than just the usual two adiabatic modes for single-field inflation, 
and the commutation relations (as given by the Ostrogradski formalism[5]) 
would be bizarre, with cp commuting with ip. (For instance, Kallosh, Kang, 
Linde, and Mukhanov[6] encounter such additional modes when the Ostro- 
gradski formalism is applied to a scalar field Lagrangian involving second 
time derivatives.) Similarly, there are metric components (such as g^^ and 
g^"^ in the ADM formalism [7]) whose time derivatives do not appear in Cq, 
but that do appear in AC If we were to take Cq + AC as the full Lagrangian, 
then the correction term AC would cause these auxiliary fields to become 
dynamical, with a further expansion of the modes of the system. 

Instead, we should remember that from the point of view of effective field 
theory, Eqs. (2) and (3) represent just the lowest two terms in an expansion 



in inverse powers of M, so we must rule out any modes that cannot be 
expanded in this way [8]. This means in particular that we must eliminate 
all second time derivatives and time derivatives of auxiliary fields in the first 
correction terms in the effective action by using the field equations derived 
from the leading terms in the action.^ In the present case, we must eliminate 
second time derivatives and time derivatives of auxiliary fields in (3) by using 
the zero-th order field equations derived from (2): 

M^Dip = MlU'iip) , R^, = -(mVm|,)v9,^99,, - Uiip)g^,, . (5) 

Using these field equations in Eq. (3) allows us, with some redefini- 
tions, to eliminate all of the terms in (3) except the first one and the last 
two. Specifically, the second term in Eq. (3) just provides a field-dependent 



This is equivalent to what is generaUy done in deriving Feynman rules in effective flat- 
space quantum field theories. Consider for instance the very simple effective Lagrangian 

where M S> m is some very large mass, and J is a c-number external current. We can 
easily find the connected part F of the vacuum persistance amplitude; 

\J(k)\' 



fc2 + m2 + k^/M^ 



If we took this result seriously, then we would conclude that in addition to the usual 
particle with mass m + 0{m^ /M^), the theory contains an unphysical one particle state 
with mass M + 0{m^ / M) . But if we regard C as just the first two terms in a power series 
in l/M'^, then we must treat the term M~'^{Uipf' as a first-order perturbation, so that 
the vacuum persistence amplitude is 



F = i / d*k\J{k)\' 



1 k" 



k^+m'2 AP{k'^ + m^)^ 

and the only pole is at k^ = —m^. This is just the same result for F that we would find 
if we were to eliminate the second time derivatives in the 0{M~^) term in C by using the 
field equation derived from the leading term in the Lagrangian 

Oifi = m ip — J . 

In this case the effective Lagrangian becomes 

C = -^[d^ipd^ifi + mV^ + m^Af "V^] + (1 + m'^/M^)Jifi - j'^ /2M^ . 

Taking into account all J-dependent terms, it is straightforward to see that with this 
Lagrangian we get the same vacuum persistence amplitude as found above for the the 
original Lagrangian, when M^^{0(p)'^ is treated as a first-order perturbation. 



correction to the kinematic term in (2), which can be ehminated by a re- 
definition of the infiaton field; the third term just provides a correction 
/3C/' Mp/M^ to the potential in (2), which can be absorbed into a redefi- 
nition of U{ip); the fourth and fifth terms supply corrections to both fi{ip) 
and the kinematic term in (2); the sixth term provides corrections to the 
kinematic term and the potential in (2); and the seventh and eighth terms 
provide corrections to the kinematic term and potential in (2) and to fi{^)- 
That is, with suitable redefinitions of f, U{f), and fi{(p), and with various 
total derivatives dropped, the Lagrangian is the sum of (2) and a correction 
term of the form 

(6) 
The first term is of the type encountered in theories of "fe-inflation" [9]. This 
term must be included in the Lagrangian, as a counterterm to ultraviolet 
divergences encountered when the leading terms in (2) are used in one-loop 
order. The second term (or an equivalent Gauss-Bonnet term) has been 
considered in connection with inflation and the evolution of dark energy [10]. 

For a general function fioif) the final term in Eq. (6) violates par- 
ity conservation [11]. That is, although the action is invariant under co- 
ordinate transformations x^ -^ x'^ that are "small," in the sense that 
T>ei{dx' / dx) > 0, it is not invariant under inversions, that is, under co- 
ordinate transformations with T)et{dx' /dx) < 0. It is only invariance under 
"small" coordinate transformations that is needed to ensure the conserva- 
tion of the energy-momentum tensor, and no sequence of "small" coordinate 
transformations can ever add up to an inversion, so there is no a priori rea- 
son to impose invariance under inversions, including space inversion. The 
fact that parity has always been observed to be conserved in gravitational 
interactions is sufficiently explained by the fact that terms in the effective 
action for gravity and scalars with no more than two spacetime derivatives 
that are invariant under "small" coordinate transformations cannot be com- 
plicated enough to violate invariance under inversions. 

From now on we shall work in perturbation theory, writing 

9fiu{^,t) = g^,y{t) + h^^{iii,t) , ip{iii,t) =ip{t) + Sip{y.,t) , (7) 

where g^u{t) is the flat-space Robertson- Walker metric with ^00 = ~1) 5oi = 
0, and gij = a?'{t)5ij; ^{t) is the unperturbed scalar field; and h^i^ and Sip 
are first-order perturbations. In this paper we will mostly be concerned with 



the terms in the Lagrangian that are quadratic in perturbations, which are 
needed for the calculation of Gaussian correlations. Terms of higher order 
in perturbations that are needed for the calculation of non-Gaussian effects 
will be considered only briefly. 

Because the spatially flat Robertson-Walker metric is also conformally 
flat it has a vanishing Weyl tensor, and so the Weyl tensor starts with a 
term of first order in perturbations. This saves us from having to calculate 
the Weyl tensor to second order in perturbations; we have simply 



[AC] 



(2) 



V9fi{^)[9^''y^,^v,u 



(2) 



where the superscripts (1) and (2) denote terms of first and second order 
in perturbations, respectively. Furthermore, the Weyl tensor is traceless, 
which to first order in perturbations gives 

Since scalar and tensor fluctuations do not interfere in Gaussian correlations, 
they will be considered separately. 

II. Scalar Fluctuations 

Here we are interested in terms in the Lagrangian that, after eliminating 
auxiliary fields, are quadratic in TZ, the familiar gauge-invariant quantity 
that is conserved outside the horizon [1] 

n^--^^, 10 

2 if> 

with A defined by writing the spatial part of the metric perturbation for 
scalar perturbations in a general gauge as 



hij{x,t) = a'^{t) 



6ijA{x,t)+ "■ ' 



^'^ ' dx^dx^ 



(11) 



Let us consider in turn the contribution of the three terms in Eq. (8) to the 
quadratic part of the Lagrangian for TZ. 

First, terms in the effective Lagrangian like the first term in Eq. (8) 
that depend only on (p and d^cpd^ip are known to enter into the part of the 

7 



Lagrangian quadratic in scalar fluctuations only through their effect on the 
sound speed Cs(t)[12]. That is, after eliminating auxiliary fields, 

(2) 



V9MJ> 



R + ^P[-d^ipd>'ip/2,^ 



MlH o 



1 7t2 - hviZf 
ci a^ 



where 



1 + 2 



X 



d^Pix,^) dPix,^) 



dx^ 



dX 



(12) 
(13) 



JX=i^V2 

In particular, the first term in Eq. (8) shifts the squared speed of sound by 



Act 



M* 



corresponding to a second-order perturbation 



V9fii^){9^''f,^V,u 



(2) 



16MV^ff^/iM-7?2 

M4F2 



(14) 



(15) 



The other terms in Eq. (8) are greatly simplified by noting that, for scalar 
modes, Cijko must take the form of Si^dj — dj^di acting on some scalar, so 



.(1) 



.(1) 



the above trace condition C^.L = implies that ClA^ = 0. Hence all we 
need to evaluate the second term in (8) are the purely spatial components of 
the Weyl tensor. Using the field equations (5) to eliminate auxiliary fields 
and second time derivatives, we find after a straightforward though tedious 
calculation that 



a'um'^r^r'r^c^rC^^l 



■iXri( fJ.iypcr 



a-'f9{'p) 






'sip 



o?h{m^ 



(16) 



Comparing this with Eq. (15), we see that the effect of the second term in 
Eq. (8) is the same as a change in the coefficient /i of the first term by an 
amount 



SMt 



.(1) 



Finally, because C^^'V^ vanishes for scalar modes, the last term in Eq. (8) 



IS 






^" '^ijlm^kOlm 



^'-'ijlo'^kOlO 



0. 



(18) 



We conclude that the leading corrections to the Gaussian correlations of 
TZ are solely of the k-inflation type. This justifies the calculation of the 
effective Lagrangian for Gaussian scalar correlations in slow roll inflation 
in Section 3 of reference 3 even for generic theories of inflation. In such 
theories the terms in Eq. (3) that are left out in reference 3 can indeed 
be omitted in calculating the part of the effective Lagrangian quadratic in 
scalar fluctuations, not because they are small, but because as we have seen 
for scalar Gaussian fluctuations they yield nothing new. But this is not the 
case for Gaussian tensor fluctuations, and does not seem to be the case when 
non-Gaussian correlations are considered. 

We have so far only considered the terms in the effective action of sec- 
ond order in TZ, which are needed to calculate Gaussian correlations, but 
for actions of the /c-inflation type, which only involve first derivatives of 
fields, it is not difficult also to calculate terms in the action of higher or- 
der in TZ, which generate non-Gaussian correlations. For this purpose it 
is convenient to adopt a gauge in which there are no scalar perturbations 
to gij] that is, in which gij = o^(f)[exp(D(x, t))]ij, where Dij is a gravita- 
tional wave amplitude with Da = and diDij = 0. In this gauge, Eq. (10) 
gives 7^ = —H5ip/(p. If we tentatively ignore the interaction of the infiaton 
with gravitational perturbations, and assume that H, /i, and (p are varying 
slowly, then it is trivial, by simply setting ip equal to (p + 5(p in £, and using 
H = -0^(M2+4/ic^^)/2M|,, to write a Lagrangian for vr = -TZ/H = bLpjip: 



V~9 



-^gf^-d^p^d,^ - Mlu{^) + fi{p^){g^''d^p^d,^f 



2 
C + a^M],H (-n^ + a'^iVirf 



This agrees with the result obtained in Eq. (28) of [3], except that here we 
include terms quartic in vr. In [3] the neglect of interactions of the infiaton 
with gravitational perturbations is justified on the basis of a "high energy" 
approximation, which amounts to the usual slow-roll approximation that 
e <C 1, plus the assumption that, in our terms, M^ ^ eHMp, which is 
much weaker than the assumption M ^ \/2eMp that we found necessary 
to treat generic theories of infiation by the methods of effective field theory. 
We see that the non-quadratic terms in Cq that can generate non-Gaussian 
correlations are suppressed in the slow roll approximation, as found by Mai- 



dacena[13], but in AL the coefficients of the quadratic and higher order 
terms are of the same order of magnitude. 

III. Tensor Fluctuations 

Tensor fluctuations appear solely in the perturbation to the purely spa- 
tial metric: 



hij{x,t) = a^it) expD ..(x,t), Da = , 



diDij 







(20) 



with 5(p = 0. The first term in Eq. (8) involves only the metric components 
g^^ and DetQf^p, so it gets no contribution from tensor fluctuations. On 
the other hand, here the second and third terms in (8) make a non-trivial 
contribution to the Lagrangian for Dij . Another straightforward calculation 
(dropping total derivatives) gives these terms as 






^" '-'ijko'-'ijkO + ^^ '-"iOfco'-'iOfcO 



a^h{^){ Dik 2H^ + 2VVa']Afe - 4i7Afc(VVa')A 



ik 



+ 2Afc(vV«')A 



ik 



(211 



and 



fiome^'''''r'^-9'^cfj^,^cfi. 



/io('/5)e 



ijkO 



.4^(1) ^(1) Q„-2^(l) ^(1) ■ 



4ho{^)e'"'''-^^[DudjV^DM 



(22) 



The field equation for the tensor mode (with the term proportional to /g 
dropped for simplicity) is then 



64^G/ioa-=^(e'^''°av2A, + e'^'^Oav^A. 



Du + 3HDu - {V^/a^)Du -- 

(23) 
For a plane wave with co-moving wave number k in the 3-direction, the only 
non- vanishing tensor amplitudes are Du = — -D22 and D12 = D2i- They 
satisfy the field equations 

D± + 3HD± + {k'^/a^)D± = ^l2S^G{k/af JwD± (24) 

where D± = Du =p iDi2 are the amplitudes with helicity ±2. As found in 
ref. [11], the wave equation depends on helicity because parity is violated. 
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IV. A Non-Generic Example: Ghost Inflation 

Up to now, we have been concerned with generic theories of inflation, in 
which the dependence of the action on the inflaton field is unconstrained, 
and in consequence the characteristic mass M cannot be taken to be much 
less than ^/eMp. For an example of a different sort, we might impose on the 
action a shift symmetry, under a transformation Lp -^ c/?+constant, which 
requires that the Lagrangian density involve only spacetime derivatives of 
(/? rather than Lp itself. This possibility was discussed briefly in [9], and in 
more detail under the name "ghost inflation" in [14]. We will take (/? to be 
normalized so that 5^c/? is dimensionless, and has an unperurturbed value 
at horizon exit of order unity. The term in the Lagrangian density that 
depends only on d^i-p is then 

£o = M^^gP{-d^^d^^) , (25) 

where P{X) is a power series in X, with coefficients assumed to be of order 
unity, and M is the characteristic mass of the theory. Since powers of 99 are 
excluded by the shift symmetry, M here can be much smaller than in generic 
theories of inflation, and in particular we will assume that M is much less 
than the Planck mass Mp. Any additional derivatives acting on d^ip or on 
the metric yield factors of order H ~ M'^ /Mp <^ M, so Eq. (25) along with 
the Einstein term can be taken as the leading term in C, with any correction 
terms suppressed by factors of H/M. 

Let us first consider a theory in which (25) is the whole Lagrangian 
density for the scalar field, with no higher-derivative corrections. The field 
equation for the unperturbed scalar field <p{t) in this theory is 

j^{a'P'{^')^)=0. (26) 

As noted in [9], in the limit of late time when a ^ 00, either c^ ^ 0, or 
(^ — > f , where u is a quantity of order unity satisfying P'(w^) = 0. We will 
consider only the latter case. In ref. [14] the limit ip = vt is supposed to 
be already reached, in which case interesting fluctuations occur only when 
higher-derivative correction terms are added to (25). But if we take (p{t) to 
be only close to vt, but not yet there, then we find a non-trivial spectrum of 
propagating fluctuations even when no correction terms are added to (25). 
In this case the solution of Eq. (26) (with an appropriate normalization of 
a(t)) has (p -^ v+a~^; the speed of sound is Cg -^ yJTJva^; the expansion rate 
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approaches a limit H^o = {M'^ /Mp)yJ P{v^)/'i; and the Fourier transform 
of IZ is 

n, c a-y^H^;i (TTr^j^TTJ,) , (27) 



3/H5i?oo^l/2a5/2 

with a /c-independent constant of proportionahty. At late times, when the 
perturbation wave length is outside the acoustic horizon, this approaches a 
time-independent quantity 7^^, with 

\ni\^ oc fc-6/5 , (28) 

corresponding to a conventional scalar slope index ns = 14/5, which of 
course is empirically ruled out. Thus to have a realistic theory of this sort, 
we must consider corrections to the leading term (25). 

The first correction to this Lagrangian density contains just one factor 
of a second derivative of 99, and in general is of the form 

A£ = M3^Q(-a^99a^ (/?)□(/? , (29) 

and is therefore suppressed relative to (25) by factors of order H/M ~ 
M/Mp. (A term proportional to g'"^g'^ (p^K(p^xip^p-a can be put in the form 
(29) by adding suitable total derivatives. In ref. [14] these terms were 
excluded by imposing an additional symmetry under the reflection cp ^ —cp, 
in which case the first correction is quadratic rather than linear in second 
derivatives of c^.) Once again, we must eliminate the second time derivatives 
in AC by setting (/3 equal to the same quantity as given by the field equation 
derived from the leading part of the Lagrangian 

P'{-d^ipd''ip)aip - 2P"{-dy^d''^){du^).^d^^d''^ = . (30) 

This is pretty complicated, so for simplicity let us consider the case of a 
metric fixed in the flat-space Robertson- Walker form. Then after using (30) 
to eliminate second time derivatives in (29), the correction term is 

r3^3 ( '^QP 



+a^'^HLpdi(pdi(p + a^^diLpdjLpdidjLp + 2>Hip^ — a^"^ ip'^V'^ ip) (31) 

where Q, P' and P" all have arguments —d^ipd^ip = ip^ — a~'^diipdi(p. This is 
the correction that has to be added to Cq in order to find the commutation 
relations of the field as well as the field equations by canonical quantization. 
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